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Abstract. Open Answer Set Programming (OASP) is an attractive framework 
for integrating ontologies and rules. In general OASP is undecidable. In pre- 
vious work we provided a tableau-based algorithm for satisfiability checking 
w.r.t. forest logic programs, a decidable fragment of OASP, which has the forest 
model property. In this paper we introduce an optimized version of that algorithm 
achieved by means of a knowledge compilation technique. So-called unit com- 
pletion structures, which are possible building blocks of a forest model, in the 
form of trees of depth 1, are computed in an initial step of the algorithm. Re- 
peated computations are avoided by using these structures in a pattern-matching 
style when constructing a model. Furthermore we identify and discard redundant 
unit completion structures: a structure is redundant if there is another structure 
which can always replace the original structure in a forest model. 



1 Introduction 

Integrating Description Logics (DLs) with rules for the Semantic Web has received 
considerable attention with approaches such as Description Logic Programs [6|, DL- 
safe rules [ 12 1, VC+log 1131 . dl-programs (TJ, Description Logic Rules [10], and Open 
Answer Set Programming (OASP) |9|. OASP is a formalism which combines attractive 
features from the Logic Programming (LP) and the DL world. The syntax and semantics 
of OASP build upon the syntax and semantics of Answer Set Programming (ASP) (5): 
there is a rule-based syntax with a negation as failure operator which is interpreted via 
a stable model semantics, but unlike the LP setting, an open domain semantics, like it 
is common in the DL world, is employed. This allows for stating generic knowledge, 
without the need to mention actual constants. 

Several decidable fragments of OASP were identified by syntactically restricting 
the shape of logic programs, while carefully safe-guarding enough expressiveness for 
integrating rule- and ontology-based knowledge. A notable fragment is that of Forest 
Logic Programs (FoLPs) [8| that are able to simulate reasoning in the DL SHOQ. 
FoLPs allow for the presence of only unary and binary predicates in rules which have a 
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tree-like structure. A sound and complete algorithm for satisfiability checking of unary 
predicates w.r.t. FoLPs has been presented in (3_). The algorithm exploits the forest 
model property of the fragment: if a unary predicate is satisfiable, than it is satisfied by 
a forest-shaped model, with the predicate checked to be satisfiable being in the label 
of the root of one of the trees composing the forest. It is essentially a tableau-based 
procedure which builds such a forest model in a top-down fashion. 

In this paper we describe an optimization for reasoning with FoLPs in the form 
of a knowledge compilation technique. The technique consists in pre-computing all 
possible building blocks of the tableau, in the form of trees of depth 1, blocks which we 
call unit completion structures. The original algorithm is used for computing the unit 
completion structures. The revised algorithm matches and appends such building blocks 
until a termination condition is met, like blocking or reaching a certain depth in the 
tableau expansion. In general, not all unit completion structures have to be considered: 
inherent redundancy in a FoLP, like rules which are less general than others gives rise 
to redundancy among completion structures. A unit completion structure is redundant 
iff there is another simpler (less constrained) unit completion structure. The latter can 
replace the former in any forest model. We formalize this notion, making it possible to 
identify such redundant structures and discard them. 

The paper is structured as follows: Section|2]contains preliminaries, like the OASP 
semantics and some notation, and Section[3jintroduces the FoLP fragment. An overview 
of the original algorithm for reasoning with FoLPs is given in Section |4] The main re- 
sults of the paper concerning the computation of non-redundant unit completion struc- 
tures, and the revised algorithm, are presented in Section [5] Finally, Section [6] draws 
some conclusions and discusses future work. 

2 Preliminaries 

We recall the open answer set semantics J5J. Constants a, b,c, . . ., variables X, Y, . . ., 
terms s,t, . . ., and atoms p(ti, . . . , t n ) are as usual. A literal is an atom L or a negated 
atom not L. We allow for inequality literals of the form s =£t, where s and t are terms. 
A literal that is not an inequality literal will be called a regular literal. For a set S of 
literals or (possibly negated) predicates, S + = {a | a 6 S} and S~ = {a | not a 6 S}. 
For a set S of atoms, not S = {not a | a 6 S}. For a set of (possibly negated) 
predicates S, S(X) = {a(X) | a e S} and S(X,Y) = {a(X,Y) | a € S}. For a 
predicate p, ±p denotes p or not p, whereby multiple occurrences of ±p in the same 
context will refer to the same symbol (either p or not p). 

A program is a countable set of rules a ^— f3, where a is a finite set of regular literals 
and f3 is a finite set of literals. The set a is the head and represents a disjunction, while 
f3 is the body and represents a conjunction. If a = 0, the rule is called a constraint. 
A special type of rules with empty bodies, are so-called free rules which are rules of 
the form: q(t\, . . . , t n ) V not q(t\, . . . , t n ) ■<— , for terms t\, . . . , t n ; these kind of rules 
enable a choice for the inclusion of atoms in the open answer sets. We call a predicate q 
free if there is a q(X\, . . . , X n ) V not q(Xi, . . . , X n ) with variables X%, . . . , X n . 
Atoms, literals, rules, and programs that do not contain variables are ground. For a rule 
or a program R, let cts(R) be the constants in R, vars(R) its variables, and preds(R) 



its predicates with upreds(R) the unary and bpreds(R) the binary predicates. For every 
non-free predicate q and a program P, P q is the set of rules of P that have q as a head 
predicate. A universe U for P is a non-empty countable superset of the constants in P: 
cts(P) C J7. We call Pjj the ground program obtained from P by applying all possible 
substitutions of variables by elements of U to every rule in P. Let Bp (Cp) be the set 
of regular atoms (literals) that can be formed from a ground program P. 

An interpretation I of a ground P is a subset of £>p. We write / |= p(t\, . . . , t n ) 
if p(t\, . . . , i„) G J and / |= not p(ii, . . . , i„) if / ^= p(ti, ■ ■ ■ , t n )- Also, for ground 
terms s, t, we write I \= s ^ t if s ^ t. For a set of ground literals L, I \= L if J |= I 
for every I G L. A ground rule r : a <— (3 is satisfied w.r.t. /, denoted / |= r, if J |= I 
for some £ G a whenever I \= ft. A ground constraint <— /3 is satisfied w.r.t. / if / ^= /3. 

For a positive ground program P, i.e., a program without not, an interpretation J of 
P is a model of P if J satisfies every rule in P; it is an answer set of P if it is a subset 
minimal model of P. For ground programs P containing not, the GL-reduct 10 w.r.t. 
/ is defined as P 1 , where P 1 contains a + <— f3 + for a <— f3 in P, / |= not f}~ and 
/ |= a - . / is an answer sef of a ground P if / is an answer set of P 1 . 

A program is assumed to be a finite set of rules; infinite programs only appear as 
byproducts of grounding with an infinite universe. An open interpretation of a program 
P is a pair (U, M) where U is a universe for P and M is an interpretation of Pjj. An 
open answer set of P is an open interpretation (U, M) of P with M an answer set of 
Pjj. An n-ary predicate p in P is satisfiable if there is an open answer set (U, M) of P 
s. t. . . . , x n ) G M, for some xi, . . . , x n G £/. 

We introduce notation for trees which extend those in [ 14 1. Let • be a concatenation 
operator between sequences of constants or natural numbers. A tree T with root c (T c ), 
where c is a specially designated constant, has as nodes sequences of the form c-s, where 
s is a (possibly empty) sequence of positive integers formed with the concatenation 
operator; for x ■ d G T, d G N*, we have that x G T. The set At = {(x,y) \ x,y G 
T, 3n G N* : y = x ■ n} is the set of arcs of a tree T. For x,y G T, we say that a; <t y 
iff a: is a prefix of y and x ^ y. 

A forest F is a set of trees {T c | c G C}, where C is a set of distinguished constants. 
We denote with Np = Upe fT and Ap = Upe fAt the set of nodes and the set of arcs 
of a forest P, respectively. Let <p be a strict partial order relationship on the set of 
nodes Np of a forest F where x <f y iff x <p y for some tree T in P. An extended 
forest EF is a tuple (P, ES) where F = {T c | c G C} is a forest and ES C N F x C. 
We denote by A^s_f = A^f the nodes of PP and by Aef — AfLI ES its arcs. So unlike 
a normal forest, an extended forest can have arcs from any of its nodes to any root of 
some tree in the forest. 

Finally, for a directed graph G, pathsa is the set of pairs of nodes for which there 
exists a path in G from the first node in the pair to the second one. 

3 Forest Logic Programs 

Forest Logic Programs (FoLPs) [8| are logic programs with tree-shaped rules which 
allow for constants and for which satisfiability checking under the open answer set 
semantics is decidable. 



Definition 1. A forest logic program (FoLP) is a program with only unary and bi- 
nary predicates, and such that a rule is either a free rule a(s) V not a(s) or 
f(s, t) V not f(s, t) <— , where s and t are terms such that if s and t are both vari- 
ables, they are different, a unary rule 

r : a(s) <- f3(s), (j m (s, t m ),5 m (t m ))i< m < k ,il) (1) 

where s and t m , 1 < m < k, are terms (again, if both s and t m are variables, they are 
different; similarly for U and tj), where 

- i> Q Ui<i^j<fe{*i tj} and{^}Pij m = for 1 < m < k, 

- Vti G vars(r) : 7+ 7^ 0, i.e., for variables U there is a positive atom that connects 
s and ti, 

or a binary rule 

/(M)«-0(a), 7 (M),5(t) (2) 

with {^} n 7 = and 7+ 7^ if t is a variable (s and t are different if both are 
variables), or a constraint <— a(s) or f(s, t) where s and t are different if both are 
variables). 

The following program P is a FoLP which says that an individual is a special mem- 
ber of an organization (smembef) if it has the support of another special member: rule 
n, or if it has the support of two regular members of the organization (member): rule 
r 2 . The binary predicate support which describes the 'has support' relationship is free. 
Nobody can be at the same time both a special member or a regular member: constraint 
n. Two particular regular members are a and b: facts r 5 and r 6 . 

Example 1. 



rt : smember(X) support (X , Y), smember(Y) 

V2 : smember(X) support (X , Y),rmember(Y), 

support(X, Z), rmember(Z), Y =^ Z 

rs : support(X , Y) V not support (X , Y) 

r Jf : *r- smember(X), rmember(X) 

r 5 : rmember(a) 
re : rmember(b) 



As their name suggests FoLPs have the forest model property: 

Definition 2. Let P be a program. A predicate p G upreds(P) is forest satisfiable w.r.t. 
P if there is an open answer set (U, M) of P and there is an extended forest EF = 
({T e } U {T a I a G cts(P)}, ES), where e is a constant, possibly one of the constants 
appearing in P , and a labeling function C : {T £ } U {T a | a G cts(P)} U Aef — > 

2preds(P) s t 

- U = Nef, and 

- V e C(e), 

- z ■ i ET G EF, i > 0, iff there is some f(z, z ■ i) G M, z G T, and 

- for y G T G EF, q G upreds(P), f G bpreds(P), we have that 



• q(y) G M iffq £ C(y), and 

• f(y,u)€Miff{u = y-iVuG cts(P)) A / G C(y, u). 



We call such a (U, M) a forest model and a program P has the forest model prop- 
erty if the following property holds: if p £ upreds(P) is satisfiable w.r.t. P then p is 
forest satisfiable w.r.t. P. 

Consider the FoLP P introduced in Example Q] The unary predicate smember 
is forest satisfiable w.r.t. P: ({a,b,x}, {r ■member '(a), rmember(b), support(x, a), 
support(x, b), smember(x)}) is a forest model in which smember appears in the la- 
bel of the (anonymous) root of one of the trees in the forest (see Figure 1). Note that 
in the ordinary LP setting, where one restricts the universe to the Herbrand universe, 
smember is not satisfiable. 

{smember} 

{rmember} a < x >■ b{rmember} 

{support} {support} 

Fig. 1. A Forest Model for P 

4 An Algorithm for Forest Logic Programs 

In this section, we give an overview of the tableau algorithm for satisfiability checking 
for FoLPs introduced in (3). For technical details we refer the reader to the original 
paper. We use as a running example the FOLP from Example Q] Constraints are not 
treated explicitly in the algorithm as they can be simulated using unary rules. As such, 
the constraint : <— smember (X), rmember (X) in Example Q] is replaced with 
r'j : co(X) ■<— not co(X), smember(X), rmember(X), with co a new predicate. 

The basic data structure used by the algorithm is a so-called completion structure. 
It contains an extended forest EF, whose set of nodes constitutes the universe of the 
model, and a labeling function CT (content), which assigns to every node, resp. arc of 
EF, a set of possibly negated unary, resp. binary predicates. The presence of a predicate 
symbol pj not p in the content of some node or arc x indicates the presence/absence of 
the atom p(x) in the open answer set. The presence (absence) of an atom in the open 
answer set is justified by imposing that the body of at least one ground rule which has 
the respective atom in the head is satisfied (no body of a rule which has the respective 
atom in the head is satisfied). In order to keep track which (possibly negated) predicate 
symbols in the content of some node or arc have already been expanded a completion 
structure contains also a so-called status function ST. Furthermore, in order to ensure 
that no atom in the partially constructed open answer set is circularly motivated, i.e. the 
atoms are well-supported [2], a graph G which keeps track of dependencies between 
atoms in the (partial) model is maintained. 

Definition 3. An A\-completion structure for a FoLP is a tuple (EF, CT, ST, G) 
where: 

1 We use the prefix Ai to denote completion structures computed using this original algorithm 
as opposed to completion structures computed using the optimised algorithm described in the 
next section for which we will use the prefix Ai . 



- EF = (F, ES) is an extended forest, 

- CT : N EF U A EF -> 2P reds ^ Unot ipreds(P)) is fa 'content' function, 

- ST : {(x, ±g) | ±g G CT(x),x G jV BF U — > {exp,unexp} is the 'status' 
function, 

- G = (V, A) is a directed graph which has as vertices atoms in the answer set in 
construction: V C 3 Pn ef ■ 

An initial A\ -completion structure for checking satisfiability of a unary predicate 
pw.r.t. a FoLP P is a completion structure (EF, CT, ST, G) with EF = (F, 0), F = 
{T £ } U {T a | a G cts(P)}, where e is a constant, possibly in cts(P), T x = {x}, for 
x G {e} U cts(P), G = (V, 0), V = {p(e)}, and CT(e) = {p}, ST (s,p) = unexp. 

An extended forest is initialized with single-node trees having as roots constants 
from P and, possibly, a new single-node tree with anonymous root. The forest model 
from Figure 1 has been evolved from an initial completion structure which has as e, the 
root element where smember has to be satisfied, the anonymous individual, x. There 
are two other single-node trees: T a and The predicate smember in the content of x 
is marked as unexpanded and G is a graph with a single vertex smember (x). 

{smember u } 

EF . a {} x b{} 

V: smember(x) 
A: 

An initial A\ -completion structure for checking the satisfiability of a unary predi- 
cate p w.r.t. a FoLP P is evolved by means of expansion rules to a complete clash-free 
structure that corresponds to a finite representation of an open answer set in case p is 
satisfiable w.r.t. P. Applicability rules govern the application of the expansion rules. 



4.1 Expansion Rules 

In the following, for a completion structure (EF, ST, CT, G), let x G Nef and (x, y) G 
Aef be the node, resp. arc, under consideration. 

(i) Expand unary positive. For a unary positive (non-free) p G Ct(x) s. t. St(x, p) — 
unexp, choose a unary rule r G P p for which s, the head term, matches x; ground this 
rule by substituting s with x, in case s is a variable, and every successor term t m which 
is a variable with a successor of x in EF s. t. the inequalities in ip are satisfied (if 
needed one can introduce new successors of x in EF, either as successors of x in T, 
where x G T, or in the form of constants from P). We motivate the presence of p(x) in 
the open answer set by enforcing its body to be satisfied by inserting appropriate (possi- 
bly negated) predicate symbols in the contents of nodes/arcs of the structure. The newly 
inserted predicate symbols are marked as unexpanded and G is updated, by adding arcs 
from p(x) to every body atom. 

In our example, smember is unexpanded in the initial completion structure. Rule r-i 
is chosen to motivate the presence of smember(x) in the open answer set. It is grounded 
by substituting X with x, and Y\ and Y 2 with a and b, respectively: smember -(x) <— 



support (x, a) , rmember(x, a), support(x, b), rmember(x, b). We enforce the body 
of this ground rule to be true and obtain the following completion structure (note also 
that G has been updated): 



{rmember u }a < 



{smember 3 } 

x 



b{rmember u } 



EF: 



{support™} 



{support™} 



V : smember(x) , support (x , a), support(x, b), rmember (x, a), rmember(x, b) 
A : smember(x) — > support(x,a),smember(x) —¥ support(x,b), 

smember(x) — > rmember(x,a), smember(x) — > rmember(x,b) 

All currently unexpanded predicates, i.e., support in the content of arcs (x, a) and 
(x, b), and rmember in the content of nodes a and b, can be trivially expanded as 
support is a free predicate and 7-5 and r§ are facts. However one still has to ensure that 
the structure constructed so far can be extended to an actual open answer set, i.e., it is 
consistent with the rest of the program. The following expansion rule takes care of this. 

(U) Choose a unary predicate. If all predicates in ct(x) and in the contents of x's 
outgoing edges are expanded and there are still unary predicates p which do not appear 
in Ct(x), pick such a p and inject either p or not p in ct(x). The intuition is that one 
has to explore all unary/binary predicates at every node/arc as some predicate which 
is not reachable by dependency-directed expansion can prohibit the extension of the 
partially constructed model to a full model. Consider the simple case where there is a 
predicate p defined only by the rule: p <— not p and ±p does not appear in the body 
of any other rule. The program is obviously inconsistent, but this cannot be detected 
without trying to prove that p is or is not in the open answer set. 

In our example, one does not know whether co or not co belongs to Ct(x). We 
choose to inject not co in ct(x) and mark it as unexpanded. 

(Hi) Expand unary negative. Justifying a negative unary predicate not p € Ct(x) 
means refuting the body of every ground rule which defines p(x), or in other words 
refuting at least a literal from the body of every ground rule which defines p(x). For 
more technical details concerning this rule we refer the reader to Q. 

In our example, the unexpanded predicate in Ct(x), not co, is defined by one rule, 
r'4, whose only possible grounding is co(x) <r- not co(x), smember (x), rmember (x). 
Refuting the body of this rule amounts to inserting not rmember in CT(a;) (smember 
and not co are already part of the content of that node). At its turn, the presence of 
not rmember in Ct(x) has to be motivated by using the expand unary negative rule, 
and the process goes on. Finally, we obtain a completion structure in which no expan- 
sion rule is further applicable and which represents exactly the forest model from Figure 
1 (smember and rmember are abbreviated with sm and rm, respectively): 



EF: 



a < 

{rm, not sm, not co} 



{sm, not rm, not co} 

x 

{support} {support} 



{rm, not sm, not co} 



b 



V : sm(x), support(x,a), support(x,b),rm(x,a), rm(x,b) 

A : sm(x) — s> support(x,a),sm(x) — > support(x,b), sm(x) — > rm(x,a), sm(x) —> rm(x, 



Similarly to rules (i), (ii), and (iii) we define the expansion rules for binary predi- 
cates: (iv) Expand binary positive, (v) Expand binary negative, and (vi) Choose binary. 

4.2 Applicability Rules 

The applicability rules restrict the use of the expansion rules. 

(vii) Saturation. A node x G Nef is saturated if for all p G upreds(P), p G Ct(x) 
or not p G Ct(;e), and no ±q G Ct(:e) can be expanded with rules (i-iii), and for 
all (x,y) G Aef and p G bpreds(P), p G CT(x,y) or no£ p G Ct(x, j/), and no 
±/ G Ct(x, y) can be expanded with (iv-vi). No expansions should be performed on a 
node from Nef which does not belong to cts(P) until its predecessor is saturated. 

(viii) Blocking. A node x G Nef is blocked if there is an ancestor y of x in F, 
y <f x, y ^ cts(P), s. t. Ct(.t) C Ct(j/) and the set pathsa(y,x) — {(p,q) 
(p(y)i<l{ x )) € pathsc A q is not free} is empty. We call (y,x) a blocking pair. No 
expansions can be performed on a blocked node. One can notice that subset blocking 
is not enough for pruning the tableau expansion. Every atom in the open answer set 
has to be finitely motivated [7, Theorem 2]: in order to ensure that, one has to check 
that there is no dependency in G between an atom formed with the blocking node and 
an atom formed with the blocked node. The extra condition makes the blocking rule 
insufficient to ensure the termination of the algorithm. The following applicability rule 
ensures termination. 

Example 2. Consider a restricted version of P from Example Q] which contains only 
rules and r%. By checking satisfiability of smember w.r.t. the new program one ob- 
tains the following completion structure: 

EF: V : {smember (x), smember (y)} A : {smember (x) —¥ smember (y)} 

x {smember} 

{support} 
y {smember} 

While the contents of nodes x and y are identical, they do not form a blocking pair as 
there is an arc in G between smember(x) and smember(y): unfolding the structure 
(justifying y similarly as x) would lead to an infinite chain: smember (x), smember (y), 
smember (z), . . . , in the atom dependency graph of the grounded program. 

(ix) Redundancy. A node x G Nef is redundant if it is saturated, it is not blocked, 
and there are k ancestors of x in F, {yi)\<i<k, with k = 2 P (2 P — 1) + 3, and p = 
\upreds(P)\, s. t. CT(a;) = CT(yj). In other words, a node is redundant if it is not 
blocked and it has k ancestors with content equal to its content: any forest model of 
a FoLP P which satisfies p can be reduced to another forest model which satisfies p 
and has at most k + 1 nodes with equal content on any branch of a tree from the forest 
model, and furthermore the (k + l)st node, in case it exists, is blocked [3]. One can thus 
search for forest models only of the latter type. As such the detection of a redundant 
node constitutes a clash and stops the expansion process. 



4.3 Termination, Soundness, Completeness, Complexity Results 

An A\ -completion structure is contradictory if for some x G Nef/Aef and p G 
upreds(P)/ bpreds(P), {p, not p} C ct(x). An ^-completion structure for a FoLP 
_P and ap 6 upreds (P) is complete if it is a result of applying the expansion rules to 
the initial completion structure for p and P, taking into account the applicability rules, 
s. t. no expansion rules can be further applied. 

Also, a complete Ai -completion structure CS = (EF, CT, ST, G) is Ai-clash-free 
if: (1) CS is not contradictory (2) EF does not contain redundant nodes (3) G does 
not contain cycles (4) there is no p G upreds (P) / 'bpreds(P) and x G Nef /Aef, % 
unblocked, s.t. p G CT(ie), and ST(x,p) = unexp. 

It has been shown that an initial At -completion structure for a unary predicate p and 
a FoLP P can always be expanded to a complete ^-completion structure (termination), 
that, if p is satisfiable w.r.t. P, there is a complete clash-free A\ -completion structure 
(soundness), and, finally, that, if there is a complete clash-free A\ -completion structure, 
p is satisfiable w.r.t. P (completeness). 

In the worst case the algorithm runs in double exponential time, and a complete 
completion structure has a double exponential number of nodes in the size of the pro- 
gram. The high complexity is mostly due to the fact that blocking is not enough to 
ensure termination, and that, in particular, "anywhere blocking"! 1 1 1 cannot be used as 
a termination technique. As already explained this peculiarity appears as a result of 
adopting a minimal model semantics. 

5 Optimized Reasoning with FoLPs 

This section presents a knowledge compilation technique for reasoning with FoLPs to- 
gether with an algorithm which makes use of this pre-compiled knowledge. The main 
idea is to capture all possible local computations, which are typically performed over 
and over again in the process of saturating the content of a node, by pre-computing 
all possible completion structures of depth 1 using the original algorithm described in 
the previous section. In the new algorithm, saturating the content of a node reduces to 
picking up one of the pre-computed structures which satisfies the existing constraints 
regarding the content of that node and appending the structure to the completion in con- 
struction: such constraints are sets of unexpanded (possibly negated) predicates which 
are needed to motivate the presence/absence in the open answer set of atoms constructed 
with the current node and the node above it. 

Picking up a certain unit completion structure to saturate a node can impose strictly 
more constraints on the resulted structure than picking another unit completion structure 
with the same root content. Such constraints refer to: (1) the contents of the successor 
(non-blocked) nodes in a unit completion structure; (2) the paths from an atom formed 
with the root node of a unit completion to an atom formed with a successor node of 
such a completion - the more paths there are the harder blocking becomes. We discard 
such structures which are strictly more constraining than others, as they can be seen as 
redundant building blocks for a model. 

The rest of the section formalizes and exemplifies these notions. 



5.1 Unit Completion Structures 

As mentioned in the introduction of this section, the intention is to obtain all completion 
structures of depth 1 which can be used as building blocks in our algorithm. We call 
such structures unit completion structures. The skeleton of such a structure, is a so- 
called initial unit completion structure. If they are to be used as building blocks in 
the algorithms, unit completion structures have to have as backbones trees of depth 1, 
and not forests. Hence, an initial unit completion structure is defined as a tree (unlike 
its counterpart notion from the previous section, initial completion structure, which is 
defined as a forest) with a single node, the root, which is either an anonymous constant 
or one of the constants already present in the program. The content of the root is empty. 

Definition 4. An initial unit completion structure for a FoLP P is a completion struc- 
ture (EF, CT, ST, G) with EF = (F, ES), F = {T £ }, where e is a constant, possibly 
in cts(P), T e = {e}, ES = $, G = (V, A), V = 0, A = 0, and Ct(s) = 0. 

A unit completion structure captures a possible local computation: that is, it is ob- 
tained as an expansion of an initial unit completion structure, to a tree of depth 1 . 

Definition 5. A unit completion structure (EF, CT, ST, G) for a FoLP P, with EF = 
({T e }, ES), is an A\-completion structure derived from an initial unit completion 
structure by application of the expansion rules (i)-(vi) described in Section \4~J] ac- 
cording to the applicability rules introduced in Section \4.2\ which has been expanded 
such that e is saturated and for all s such that e ■ s £ T e , and for all ip £ CT(e ■ s), 
ST(±p, e ■ s) = unexpQ 

Example 3. Consider the program Pr: 

p(X) <- notp(X) 
p(X)<-f(X,Y),not q(Y) 
p(X)^f(X,Y),p(Y) 
p(X)^f(X,Y),not q(Y),p(Y) 
q(X)^f(X,Y),notp(Y) 
f(X, Y)Vnotf(X, Y) <- 

Figure 2 depicts three unit completion structures for Pr. They all have the same content 
for the root node: {p, not q}. The presence of p in the content of the root node has been 
motivated in the first structure by means of rule r^, in the second structure by means of 
rule r3, and in the third structure by means of rule r<z. The different ways to derive p 
lead to different sets of arcs in the dependency graphs belonging to each structure. On 
the other hand, to motivate that not q is in the content of the root node, in each case 
it was shown that the body of r$ grounded such that X is instantiated as the root node 
and Y as the successor node is not satisfied, or more concretely the presence of p in the 
content of the successor node was enforced in each case (not f could not be used to 
invalidate the triggering of the rule as / was already present in the content of the arc 
from the root node to the successor node in each case). 

2 The status function is relevant only in the definition/construction of a unit completion structure, 
but not in the context of using such structures. As such, we will denote a unit completion 
structure in the following as a triple (EF, CT, G). 
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Fig. 2. Three unit completion structures for Pr: UC\, UC2, and U C3. 



One can notice that while the content of the successor node is included in the content 
of the root node in each of the cases, only for UC3, the two nodes form a blocking pair 

as pathsa 3 (c,cl) = 0. 

Definition 6. A unit completion structure is final iff all its successor nodes are blocked, 
or they have empty contents. 

Proposition 1. A final unit completion structure is a complete clash-free Ai-completion 
structure. 

In our example UC3 is a final unit completion structure, and thus also a complete 
clash-free Ai -completion structure. 

Proposition 2. There is a deterministic procedure which computes all unit completion 
structures for a FoLP P in the worst-case scenario in exponential time in the size of P. 

Proof Sketch. We consider the transformation of the non-deterministic algorithm 
described in Definition [3] into a deterministic procedure. There are at most 2™ different 
values for the content of a saturated node, where n = \upreds(P)\. Justifying the pres- 
ence of a predicate symbol p in the content of a node takes in the worst case polynomial 
time, but there is an exponential number of choices to do this (an exponential number 
of possible groundings for every rule). Justifying the presence of a negated predicate 
symbol not p in the content of a node takes in the worst case exponential time (all pos- 
sible groundings of every rule r £ P p have to be considered), while at every step of 
the computation there is a polynomial number of choices. Overall, such a deterministic 
procedure runs in exponential time in the worst case scenario. □ 

5.2 Redundant Unit Completion Structures 

As seen in Example [3] there are unit completion structures with roots with equal con- 
tent, but possibly different topologies, contents of the successor nodes and/or possibly 



different dependency graphs. As discussed in the introduction to this section it is worth- 
while to identify structures which are strictly more constraining than others, in the sense 
that they impose more constraints on the content of the successor nodes of the struc- 
ture and introduce more paths in the dependency graph as they can be discarded. The 
following definition singles out such redundant structures. 

Definition?. A unit completion structure UC\ = {EF\, CTi, G\), with EF\ = 
({T £l }, ESi), is said to be redundant iff there is another unit completion structure 
UC 2 = (EF 2 , CT 2 , G 2 ), with EF 2 = ({T £2 }, ES 2 ) s. t.: 

- if e 2 e cts(P), then e 2 = e\; 

- CT(ei) = CT(e 2 ); 

- if £2 • Si, • • • , £2 • si are the non-blocked successors of e 2 , there exist I distinct 
successors E\ ■ t\, . . . , S\ ■ ti ofe\ such that: 

• CT(e 2 • Si) C CT(ei • ti), for every 1 < i < I, and 

• pathsa 2 (e 2 ,e 2 • Sj) C pathsc 1 (s:i,si ■ U), for every 1 < i < I, 
with at least one inclusion being strict. 

Considering the previous example, one can see that UC\, and UC 2 are redundant 
structures, while UC3 is not, as U C\ is more constraining than UC 2 , and UC 2 at its 
turn is more constraining than UC3. 

Proposition 3. Computing the set of non-redundant unit completion structures for a 
FoLP P can be performed in the worst case in exponential time in the size of P. 

Proof Sketch. The result follows from the fact that there is an exponential number 
of unit completion structures for a FoLP P in the worst case scenario. □ 

5.3 Reasoning with FoLPs Using Unit Completion Structures 

We define a new algorithm which uses the set of pre-computed non-redundant com- 
pletion structures. We call this algorithm A 2 . As in the case of the previous algorithm, 
A 2 starts with an initial A 2 -completion structure for checking satisfiability of a unary 
predicate p w.r.t. a FoLP P and expands this to a so-called A 2 -completion structure. 

An A 2 -completion structure (EF , CT, ST, G) is defined similarly as an A\ -completion 
structure, but the status function has a different domain, the set of nodes of the forest: 
ST : Nef ^ {exp, unexp}. 

An initial A 2 -completion structure for a unary predicate p and FoLP P is defined 
similarly as an initial Ai -completion structure forp and P, but in this case every node 
in the extended forest is marked as unexpanded: St(x) = unexp, for every x <E Nef- 

The difference in the definition of an ^-completion structure compared to its A\ 
homonym is due to the fact that in this scenario nodes are expanded by matching their 
content with existent unit completion structures, and not predicates in the content of 
nodes. We make explicit the notion of matching the content of a node with a unit com- 
pletion structure by introducing a notion of local satisfiability: 

Definition 8. A unit completion structure U C for a FoLP P, {EF , CT, G), with EF = 
({T e }, ES), locally satisfies a (possibly negated) unary predicate p iff p G CT(e). Simi- 
larly, UC locally satisfies a set S of (possibly) negated unary predicates iff S C CT(e). 



All three unit completions in Figure 2 locally satisfy the set {a, not b}. It is easy 
to observe that if a unary predicate p is not locally satisfied by any unit completion 
structure UC for a FoLP P (or equivalently not p is locally satisfied by every unit 
completion structure), p is unsatisfiable w.r.t. P. However, local satisfiability of a unary 
predicate p in every unit completion structure for a FoLP P does not guarantee 'global' 
satisfiability of p w.r.t. P (as in the case of the program in Example [2] whose only unit 
completion structure was the one depicted in that example). 

When building an _4 2 -completion structure CS = (EF, CT, ST, G), with G = 
(V, A), for a FoLP P by using unit completion structures as building blocks an operation 
commonly appears: the expansion of a node x £ Nef by addition of a unit completion 
structure UC = {EF , CT, C), with EF = {{T E },ES') and G' = {V',A'), which 
locally satisfies Ct(.t), at x, given that its root matches with £0. We call this operation 
expandcs{x, UC). Formally, its application updates CS as follows: 

- ST(x)=exp, 

- N EF = N EF U {x ■ s \ s ■ s £ T e }, 

- A E f = A E f U {{x, x ■ s) | (e, e ■ s) £ A EF > }, 

- Ct(x) = CT(e). For all s such that e ■ s £ T £ , Ct(x • s) = CT(e • s), 

- V = V U {p(x) | p £ CT(e)} U {p(x ■ s) I p £ CT(e • s)}, 

- A = A U {(p(z), q(y)) \ (p(z), l{y)) £ A}, where e = x, and 'e T l — x ■ s. 

The revised algorithm has a new rule which we call Match. This rule replaces the 
expansion rules (i)-(vi) and the applicability rule (vii) from the original algorithm. 

Match. For a node x £ Nef- if St(se) = unexp choose a non-redundant unit 
completion structure UC with root matching x which satisfies ct(x) and perform 
expandcs( x , UC). 

However, rules (viii) Blocking and (ix) Redundancy are still used. 

Definition 9. A complete ^-completion structure for a FoLP P and ap £ upreds (P), 
is an A2-completion structure that results from applying the rule Match to an initial A2- 
completion structure for p and P, taking into account the applicability rules (viii) and 
(ix), s. t. no other rules can be further applied. 

The local clash conditions regarding contradictory structures or structures which 
have cycles in the dependency graph G are no longer relevant: 

Definition 10. A complete A2-completion structure CS — {EF ', CT, ST, G) is clash- 
free if (1) EF does not contain redundant nodes (2) there is no node x £ Nef, % 
unblocked, s.t. st(x) — unexp. 

Termination follows from the usage of the blocking and of the redundancy rule: 

Proposition 4. An initial A2-completion structure for a unary predicate p and a FoLP 
P can always be expanded to a complete A2-completion structure. 

3 An anonymous individual matches with any term, while a constant matches only with itself; 
thus, unit completion structures with roots constants can only be used as initial building blocks 
for trees with roots the corresponding constants. 



The algorithm is sound and complete: 



Proposition 5. A unary predicate p is satisfiable w.r.t. a FoLP P iff there is a complete 
clash-free A2-completion structure. 

Proof Sketch. Soundness of A2 follows from soundness of A\. any completion 
structure computed using A2 could have actually been computed using A\ by replacing 
every usage of the Match rule with the corresponding rule application sequence used by 
A\ to derive the unit completion structure which is currently appended to the structure. 

Completeness of A2 follows from completeness of A\ : any clash-free complete A\- 
completion structure can be seen as a complete clash-free ^-completion structure. It 
is essential here that the discarded unit completion structures were strictly more con- 
straining than some other (preserved) unit completion structures. Whenever the expan- 
sion of a node in the complete clash-free A\ -completion structure has been performed 
by a sequence of rules captured by a redundant unit completion structure, it is possible 
to construct a complete clash-free A2 -completion structure by using the simpler non- 
redundant unit completion structure instead. □ 

As we still employ the redundancy rule in this version of the algorithm, a complete 
A2 -completion structure has in the worst case a double exponential number of nodes in 
the size of the program. As such: 

Proposition 6. A2 runs in the worst-case in double exponential time. 

6 Discussion and Outlook 

Our optimized algorithm runs in the worst case in double exponential time: this is not a 
surprise as the scope of the technique introduced here is saving time by avoiding redun- 
dant local computations. The worst-case running complexity of the algorithm depends 
on the depth of the trees which have to be explored in order to ensure completeness of 
the algorithm and on the fact that anywhere blocking is not feasible. Even with classical 
subset blocking one has to explore an exponential number of nodes across a branch in 
order for the algorithm to terminate. Thus, the only factor which would improve the 
worst-case performance is finding a termination condition which considers nodes in 
different branches. At the moment this seems highly unattainable. 

The next step of our work is the evaluation of the new algorithm. We expect it 
will perform considerably better than the original algorithm in returning positive an- 
swers to satisfiability checking queries, while it might still take considerable time in the 
cases where a predicate is not satisfiable. Especially problematic are cases like the one 
described in Example [2] where there exists a unit completion structure which locally 
satisfies the predicate checked to be satisfiable, but the predicate is actually unsatisfi- 
able. An obvious strategy for implementation is to establish a limit on the depth of the 
explored structures: in practice it is highly improbable that if there exists a solution, it 
can be found only in an open answer set of a considerable size: actually, it is quite hard 
to come up with examples of such situations. 

We note that there are also related optimization approaches to ours which do not 
improve on the worst-case complexity of algorithm, but, which in practice prove to be 



considerably better than the original algorithms. A knowledge compilation technique 
for reasoning with the DL ACC is described in \4\. First, all sub-concepts of a concept 
which are conjunctions of simple concepts and role restrictions are computed in the 
form of so-called paths which are sets of simple concepts and role restrictions. Paths 
which contain contradictory concepts (links) are removed, as well as paths which are 
super-sets of other paths: this is similar to our method in what concerns removing lo- 
cal contradictions and redundancy. However, we also remove redundancies in the set 
of dependencies between atoms in the model. Then, role restrictions are considered: all 
links for 'potentially reachable' concepts from the original concept are removed and a 
so-called linkless graph is obtained. Unlike that, we investigate only structures of depth 
1 : we consider that pre-computing structures with higher depth would be an overkill. 
The linkless graph is exploited for checking concept consistency and answering sub- 
sumption queries. Both reasoning tasks take in the worst case exponential time. 
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